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The minimum-cost flow (MCF) problem is a fundamental optimization problem 
with many applications and seems to be well understood. Over the last half cen¬ 
tury many algorithms have been developed to solve the MCF problem and these 
algorithms have varying worst-case bounds on their running time. However, these 
worst-case bounds are not always a good indication of the algorithms’ performance 
in practice. The Network Simplex (NS) algorithm needs an exponential number 
of iterations for some instances, but it is considered the best algorithm in practice 
and performs best in experimental studies. On the other hand, the Minimum-Mean 
Cycle Canceling (MMCC) algorithm is strongly polynomial, but performs badly in 
experimental studies. 

To explain these differences in performance in practice we apply the framework 
of smoothed analysis. We show an upper bound of 0(mn^ log(n) log((/>)) for the 
number of iterations of the MMCC algorithm. Here n is the number of nodes, m 
is the number of edges, and 4> is a parameter limiting the degree to which the edge 
costs are perturbed. We also show a lower bound of H(m log((/i)) for the number 
of iterations of the MMCC algorithm, which can be strengthened to H(mn) when 
4> = 0(n^). For the number of iterations of the NS algorithm we show a smoothed 
lower bound of H(m • min{n, (p} ■ cp). 


1 Introduction 

The minimum-cost flow (MCF) problem is a well-studied problem with many applications, 
for example, modeling transportation and communication networks 00 ■ Over the last half 
century many algorithms have been developed to solve it. The first algorithms proposed in the 
1960s were all pseudo-polynomial. These include the Out-of-Kilter algorithm by Minty 
and by Fulkerson [^, the Cycle Canceling algorithm by Klein [^, the Network Simplex (NS) 
algorithm by Dantzig [^, and the Successive Shortest Path (SSP) algorithm by Jewell [11| , 
Iri [^, and Busacker and Gowen [^. In 1972 Edmonds and Karp proposed the Capac¬ 
ity Scaling algorithm, which was the first polynomial MCF algorithm. In the 1980s the first 
strongly polynomial algorithms were developed by Tardos and by Orlin [18] . Later, sev¬ 
eral more strongly polynomial algorithms were proposed such as the Minimum-Mean Cycle 
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Canceling (MMCC) algorithm by Goldberg and Tarjan and the Enhanced Capacity Scaling 
algorithm by Orlin [^, which currently has the best worst-case running time. For a more 
complete overview of the history of MCE algorithms we refer to Ahuja et al. [^. 

When we compare the performance of several MCE algorithms in theory and in practice, we 
see that the algorithms that have good worst-case bounds on their running time are not always 
the ones that perform best in practice. Zadeh showed that there exist instances for which 
the Network Simplex (NS) algorithm has exponential running time, while the Minimum-Mean 
Cycle Canceling (MMCC) algorithm runs in strongly polynomial time, as shown by Goldberg 
and Tarjan [^. In practice however, the relative performance of these algorithms is completely 
different. Kovacs 15 showed in an experimental study that the NS algorithm is much faster 
than the MMCC algorithm on practical instances. In fact, the NS algorithm is even the fastest 
MCE algorithm of all. An explanation for the fact that the NS algorithm performs much better 
in practice than indicated by its worst-case running time is that the instances for which it needs 
exponential time are very contrived and unlikely to occur in practice. To better understand 
the differences between worst-case and practical performance for the NS algorithm and the 
MMCC algorithm, we analyze these algorithms in the framework of smoothed analysis. 

Smoothed analysis was introduced by Spielman and Teng [22] to explain why the simplex 
algorithm usually needs only a polynomial number of iterations in practice, while in the worst 
case it needs an exponential number of iterations. In the framework of smoothed analysis, an 
adversary can specify any instance and this instance is then slightly perturbed before it is used 
as input for the algorithm. This perturbation can model, for example, measurement errors or 
numerical imprecision. In addition, it can model noise on the input that can not be quantified 
exactly, but for which there is no reason to assume that it is adversarial. Algorithms that have 
a good smoothed running time often perform well in practice. We refer to two surveys [l6, 23 
for a summary of results that have been obtained using smoothed analysis. 

We consider a slightly more general model of smoothed analysis, introduced by Beier and 
Vocking [^. In this model the adversary can not only specify the mean of the noisy parameter, 
but also the type of noise. We use the following smoothed input model for the MCE problem. 
An adversary can specify the structure of the flow network including all nodes and edges, 
and also the exact edge capacities and budgets of the nodes. However, the adversary can not 
specify the edge costs exactly. For each edge e the adversary can specify a probability density 
Qe '■ [0,1] —)• [0, (f)] according to which the cost of e is drawn at random. The parameter cj) 
determines the maximum density of the density function and can therefore be interpreted as 
the power of the adversary. If cj) is large, the adversary can very accurately specify each edge 
cost and we approach worst-case analysis. \i (j) = 1, the adversary has no choice but to specify 
the uniform density on the interval [0,1] and we have average-case analysis. 

Brunsch et al. were the first to show smoothed bounds on the running time of an MCE 
algorithm. They showed that the SSP algorithm needs 0{mn(f)) iterations in expectation and 
has smoothed running time 0{mn4>{m + nlog((^))), since each iteration consists of hnding a 
shortest path. They also provide a lower bound of Q{m ■ minjre, cj)} ■ 4>) for the number of 
iterations that the SSP algorithm needs, which is tight for (f> = n(n). These bounds show that 
the SSP algorithm needs only a polynomial number of iterations in the smoothed setting, in 
contrast to the exponential number it needs in the worst case, and explains why the SSP algo¬ 
rithm performs quite well in practice. In order to fairly compare the SSP algorithm with other 
MGF algorithms in the smoothed setting, we need smoothed bounds on the running times of 
these other algorithms. Brunsch et al. asked particularly for smoothed running time bounds 
for the MMCG algorithm, since the MMGG algorithm has a much better worst-case running 
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time than the SSP algorithm, but performs worse in practice. It is also interesting to have 
smoothed bounds for the NS algorithm, since the NS algorithm is the fastest MCF algorithm 
in practice. However, until now no smoothed bounds were known for other MCF algorithms. 
In this paper we provide smoothed lower and upper bounds for the MMCC algorithm, and a 
smoothed lower bound for the NS algorithm. 

For the MMCC algorithm we provide an upper bound (Section]^ for the expected number 
of iterations that the MMCC algorithm needs of 0{mv? log(n) log((/i)). For dense graphs, this 
is an improvement over the 0(m^n) iterations that the MMCC algorithm needs in the worst 
case, if we consider cj) a constant (which is reasonable if it models, for example, numerical 
imprecision or measurement errors). 

We also provide a lower bound (Section 3.1) on the number of iterations that the MMCC 
algorithm needs. For every n, every m G {n, n + 1,..., n^}, and every cj) < 2*^, we provide an 
instance with 0(n) nodes and 0(m) edges for which the MMCC algorithm requires H(m log((/))) 
iterations. For (j) = H(n^) we can improve our lower bound (Section 3.2). We show that for 
every n > 4 and every m G {n, n + 1,..., n^}, there exists an instance with 0(n) nodes and 
0(m) edges, and (j) = for which the MMCC algorithm requires ^l(mn) iterations. This 

is indeed a stronger lower bound than the bound for general 4>, since we have mlog{4>) = 
0(mlog(n)) for (j) = 0(n^). 

For the NS algorithm we provide a lower bound (Section]^ on the number of non-degenerate 
iterations that it requires. In particular, we show that for every n, every m G {n,..., n^}, and 
every <() < 2”’ there exists a flow network with 0(n) nodes and 0(m) edges, and an initial 
spanning tree structure for which the NS algorithm needs H(m • min{n, 0} • (j)) non-degenerate 
iterations with probability 1. The existence of an upper bound is our main open problem. Note 
that our bound is the same as the lower bound that Brunsch et al. found for the smoothed 
number of iterations of the SSP algorithm. This is no coincidence, since we use essentially the 
same instance (with some minor changes) to show our lower bound. We show that with the 
proper choice of the initial spanning tree structure for the NS algorithm, we can ensure that 
the NS algorithm performs the same flow augmentations as the SSP algorithm and therefore 
needs the same number of iterations (plus some degenerate ones). 

In the rest of our introduction we introduce the MCF problem, the MMCC algorithm and 
the NS algorithm in more detail. In the rest of our paper, all logarithms are base 2. 


1.1 Minimum-Cost Flow Problem 

A flow network is a simple directed graph G = {V, E) together with a nonnegative capacity 
function u : E ^ M+ defined on the edges. For convenience, we assume that G is connected and 
that E does not contain a pair (u, v) and {v, u) of reverse edges. For the MCF problem, we also 
have a cost function c : F —)■ [0,1] on the edges and a budget function 5 : F —)■ M on the nodes. 
Nodes with negative budget require a resource, while nodes with positive budget offer it. A 
flow f : E ^ M_|_ is a nonnegative function on the edges that satisfies the capacity constraints, 
0 < /(e) < u{e) (for all e G E), and flow conservation constraints b{v) -|- 'f2e={uv)eE ~ 
Yle'=(v w)&E ^ ^ ^)- ^ f defined as the sum of the flow 

on each edge times the cost of that edge, that is, c(/) = Yle&E ' /(^)- The objective of the 
minimum-cost flow problem is to find a flow of minimum cost or conclude that no feasible flow 
exists. 

In onr analysis we often use the concept of a residual network, which we define here. For 
an edge e = {u,v) we denote the reverse edge {v,u) by e~^. For flow network G and flow 
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/, the residual network Gf is defined as the graph Gj = {V^EfUE}^). Here = {e | e G 
E and /(e) < M(e)} is the set of forward edges with capacity u'{e) = u{e) — /(e) and cost 
c'(e) = c(e). -Eb = {e I e~^ G E and /(e“^) > 0} is the set of backward edges with capacity 
u'{e) = /(e“^) and cost c'(e) = —c(e“^). Here u'{e) is also called the residual capacity of edge 
e for flow /. 


1.2 Minimum-Mean Cycle Canceling Algorithm 

The MMCC algorithm works as follows: 

• First we find a feasible flow using any maximum-flow algorithm. 

• Next, as long as the residual network contains cycles of negative total cost, we find a 
cycle of minimum-mean cost and maximally augment flow along this cycle. 


We stop when the residual network does not contain any cycles of negative total cost. 


For a more elaborate description of the MMCC algorithm, we refer to Korte and Vygen [14| . 
In the following, we denote the mean cost of a cycle C by /i(C) = (X]egc'c(e)) /\G\. Also, 
for any flow /, we denote the mean cost of the cycle of minimum-mean cost in the residual 
network G/ by ^(/). 

Goldberg and Tarjan proved in 1989 that the Minimum-Mean-Cycle Canceling algorithm 
runs in strongly polynomial time. Five years later Radzik and Goldberg 21 slightly improved 


this bound on the running time and showed that it is tight. In the following we will focus on 
the number of iterations the MMCC algorithm needs, that is, the number of cycles that have 
to be canceled. A bound on the number of iterations can easily be extended to a bound on the 
running time, by noting that a minimum-mean cycle can be found in 0{nm) time, as shown 
by Karp |12| . The tight bound on the number of iterations that the MMCC algorithm needs 
is as follows. 


Theorem 1.1 (Radzik and Goldberg). The number of iterations needed by the MMCC algo¬ 
rithm is bounded by 0{nw?‘) and this bound is tight. 

To prove our smoothed bounds in the next sections, we use another result by Korte and 
Vygen |14[ Corollary 9.9] which states that the absolute value of the mean cost of the cycle 
that is canceled by the MMCC algorithm, |/i(/)|, decreases by at least a factor 1/2 every mn 
iterations. 


Theorem 1.2 (Korte and Vygen). Every mn iterations of the MMCC algorithm, |/u(/)| de¬ 
creases by at least a factor 1/2. 


1.3 Network Simplex Algorithm 

The Network Simplex (NS) algorithm starts with an initial spanning tree structure (T, L, U) 
and associated flow /, where each edge in E is assigned to exactly one of T, L, and U, and it 
holds that 

• /(e) = 0 for all edges e G L, 

• /(e) = u{e) for all edges e G U, 
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• 0 < /(e) < u{e) for all edges e £ T, 

• the edges of T form a spanning tree of G (if we consider the undirected version of both 
the edges of T and the graph G). 


If the MCF problem has a feasible solution, such a structure can always be found by first 
finding any feasible flow and then augmenting flow along cycles consisting of only edges that 
have a positive amount of flow less than their capacity, until no such cycles remain. Note that 
the structure (T, L, U) uniquely determines the flow /, since the edges in T form a tree. In 
addition to the spanning tree structure, the NS algorithm also keeps track of a set of node 
potentials 7r(u) for all nodes v £ V. The node potentials are defined such that the potential of 
a specified root node is 0 and that the potential for other nodes is such that the reduced cost 
c"{u, v) = c(n, u) — tt{u) + Tr{v) of an edge {u, v) equals 0 for all edges (u, v) £ T. 

In each iteration, the NS algorithm tries to improve the current flow by adding an edge to 
T that violates its optimality condition. An edge in L violates its optimality condition if it 
has strictly negative reduced cost, while an edge in U violates its optimality condition if it 
has strictly positive reduced cost. One of the edges e that violates its optimality condition is 
added to T, which creates a unique cycle C in T. Flow is maximally augmented along C, until 
the flow on one of the edges e' £ G becomes 0 or reaches its capacity. The edge e' leaves T, 
after which T is again a spanning tree of G. Next we update the sets T, L, and U, the flow 
and the node potentials. This completes the iteration. If any edges violating their optimality 
condition remain, another iteration is performed. One iteration of the NS algorithm is also 
called a pivot. The edge e that is added to T is called the entering edge and the edge e' that 
leaves T is called the leaving edge. Note that in some cases the entering edge can be the same 
edge as the leaving edge. Also, if one of the edges in the cycle G already contains flow equal 
to its capacity, the flow is not changed in that iteration, but the spanning tree T still changes. 
Such an iteration we call degenerate. 

Note that in each iteration, there can be multiple edges violating their optimality condition. 
There are multiple possible pivot rules that determine which edge enters T in this case. In 
our analysis we use the (widely used in practice) pivot rule that selects as the entering edge, 
from all edges violating their optimality condition, the edge for which the absolute value of its 
reduced cost |c’^(e)| is maximum. In case multiple edges in C are candidates to be the leaving 
edge, we choose the one that is most convenient for our analysis. 

If a strongly feasible spanning tree structure is used, it can be shown that the number of 
iterations that the NS algorithm needs is finite. However, Zadeh [25| showed that there exist 
instances for which the NS algorithm (with the pivot rule stated above) needs an exponential 
number of iterations. Orlin 20 developed a strongly polynomial version of the NS algorithm, 
which uses cost-scaling. However, this algorithm is rarely used in practice and we will not 
consider it in the rest of our paper. For a more elaborate discussion of the NS algorithm we 
refer to Ahuja et al. [^. 


2 Upper Bound for the MMCC Algorithm 

In this section we show an upper bound of 0(mn^ log(n) log((/))) for the expected number of 
iterations that the MMCC algorithm needs starting from the initial residual network G^ for 

that G is simple and that E does not contain a pair {u,v) and {v,u) of reverse edges. This 


the feasible starting flow / for flow network G = (V,E). Note that we assumed in Section 


1.1 
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implies that for each pair of nodes u,v G V, there is always at most one edge from n to u and 
at most one edge from t? to tt in any residual network Gf. We first show that the number of 
cycles that appears in at least one residual network Gf for a feasible flow / on G, is bounded 
by (n + 1)!, where n = |y|. 

Lemma 2.1. The total number of cycles that appears in any residual network Gf for a feasible 
flow f on G, is bounded by (n + 1)!. 

Proof. First we show that the number of directed cycles of length k (2 < k < n) is bounded 
by n!. We identify each cycle G of length k with a path P of length k starting and ending at 
the same arbitrarily chosen node of G and then following the edges of G in the direction of 
their orientation. Every such path P can be identihed with a unique cycle. The number X of 
possible paths of length k is bounded by 

X < n • (n — 1) • ... • (n — A: + 1) < n! (1) 

The first inequality of Equation Q follows since there are at most n possible choices for the 
hrst node of the path, at most n — 1 choices for the second node, etc. 

The lemma follows by observing that the number of possible lengths of cycles in residual 
networks Gf is bounded by re + 1. □ 

We next show that the probability that any particular cycle has negative mean cost close to 
0 can be bounded. In the rest of this section, e > 0. 

Lemma 2.2. The probability that an arbitrary cycle G has mean cost fr{C) G [—e)0[ can be 
bounded by necp. 

Proof. We can only have p,{G) G [—e, 0[ if c(G) G [—ree, 0[, since G consists of at most re edges. 
We now draw the costs for all edges in G except for the cost of one edge e. The cost of cycle 
G depends linearly on the cost of edge e, with coefficient 1 if e is a forward edge in G and 
coefficient —1 if e is a reverse edge in G. Therefore, the width of the interval from which the 
cost of e must be drawn such that c(C') G [—ree, 0[ is ree. Since the density function according 
to which the cost of e is drawn has maximum density (j), the probability that the cost of e is 
drawn from this interval is at most nef. □ 


Corollary 2.3. The probability that there exists a cycle G with p,{C) G [—e,0[ is at most 

(re + ly.nef). 

Proof. The corollary follows directly from Lemma |2.1| and Lemma |2.2[ □ 

Lemma 2.4. If none of the residual networks Gf for feasible flows f on G contain a cycle G 
with pl{C) G [—e,0[, then the MMCC algorithm needs at most rrere|'log 2 (l/e)] iterations. 


Proof. Assume to the contrary that none of the residual networks Gf for feasible flows / 
on G contain a cycle G with p,{G) G [—e, 0[, but the MMCC algorithm needs more than 
rrere[log 2 (l/e)] iterations. Let / denote the starting flow found using any maximum-flow al¬ 
gorithm. Since all edge costs are drawn from the interval [0,1], we have that |//(/)| < 1- 
According to Theorem 1.2 after mre[log 2 (l/e)] iterations we have that for the current flow 
/ holds that |/u(/)| < s. Now either /r(/) > 0 contradicting the assumption that the MMCC 
algorithm needs more than 7rere[log2(l/e)] iterations, or fj.{f) G [—e,0[ contradicting the as¬ 
sumption that none of the residual networks Gf for feasible flows f on G contain a cycle G 
with p.{G) G [—e,0[. □ 
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Theorem 2.5. The expected number of iterations that the MMCC algorithm needs is at most 
0{mn^ log(n) log((?!))). 

Proof. Let T be the expected number of iterations that the MMCC algorithm needs. We have 


OO 

E(r) = ^p(r > t) 

t=l 

OO 

< P(any Gf contains a cycle C with pl{C) e , 0[) (2) 

t = l 

OO 

< mn^ l'log(n)] |'log((^)] + ^ (n + (3) 

t=mn? I'log(n)] |'log((/))] +1 

OO 

< mn^ l'log(n)] |'log((?!))] + ^ (4) 

t=o 

OO 

= mn^ l'log(n)] |'log((^)] + mn 2“* 

t=o 

= 0 {mn^ log(n) log(i;i))) 


Here Equation ([^ follows from Lemma 2.4 Equation Q follows by bounding the probability 
for the first mn^[log(n)] [log(i?i)] terms of the summation by 1 and the probability for the 
other terms using Corollary ( |2.3[ ). Finally, Equation Q follows from the inequality nlog(n) > 
log((n + 2)!), which holds for n > 6. 

□ 


3 Lower Bound for the MMCC Algorithm 

3.1 General Lower Bound 

In this section we describe a construction that, for every n, every m G {n, n + l,...,n^}, 
and every (f) < 2"^, provides an instance with 0(n) nodes and 0(m) edges for which the 
MMCC algorithm requires 0(m log((/))) iterations. For simplicity we describe the initial residual 
network G, which occurs after a flow satisfying all the budgets has been found, but before the 
first minimum-mean cycle has been canceled. For completeness, we will explain at the end of 
the description of G how to choose the initial network, budgets, and starting flow such that G 
is the first residual network. 

We now describe how to construct G given n, m, and (f. In the following, we assume 4> > 64. 
If (j) is smaller than 64, the lower bound on the number of iterations reduces to H(m) and 
a trivial instance with 0(n) nodes and 0(m) edges will require II(m) iterations. We define 
ku) = [^(log(i;i)) — 4)J and kx = [|(log((^) — 5)J. Note that this implies that kx = k^, or 
kx = kw — For the edge costs we define intervals from which the edge costs are drawn 
uniformly at random. We define G = {V,S) as follows (see Figure [^. 

• V = {a, b,c,d} U U Li V Li W U X, where U = {ui, ..., Un}., V = {ui, ..., u„}, W = 
{wi,... and X = {xi,.. .,XkJ. 
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• £ = EuvU EaU EbU EcU EdU E^U E^. 


• Euv is an arbitrary subset of ?7 x F of cardinality m. Each edge {ui,Vj) has capacity 1 
and cost interval [0, !/(/>]. 

• Ea contains the edges {a,Ui), Ei, contains the edges Ec contains the edges {c,Vi), 

and Ed contains the edges {vi,d) {i = 1,... ,n). All these edges have infinite capacity 
and cost interval [0, !/(/>]. 

• Eyj contains the edges {d,Wi) and {wi,a) {i = 1,...,An edge {d,Wi) has ca¬ 
pacity m and cost interval [0, !/(/>]. An edge {wi,a) has capacity m and cost interval 
[_22-2*^_22-2* + i/^]_ 

• Ex contains the edges (6, Xi) and (xj, c) (i = 1,..., kx)- An edge (6, Xi) has capacity m and 
cost interval [0, !/</>]. An edge (xj, c) has capacity m and cost interval [—2^“^®, —2^“^® -|- 
l/^]. 

Note that all cost intervals have width l/(f> and therefore correspond to valid probability 
densities for the edge costs, since the costs are drawn uniformly at random from these intervals. 
The edges of the types {wi, a) and (xj, c) have a cost interval that corresponds to negative edge 
costs. The residual network with these negative edge costs can be obtained by having the 
following original instance (before computing a flow satisfying the budget requirements): All 
nodes, edges, costs and capacities are the same as in G, except that instead of the edges of 
type {wi,a) we have edges {a,Wi) with capacity m and cost interval [2^“^® — 1/0,2^“^*] and 
instead of the edges of type (xj,c) we have edges (c, Xj) with capacity m and cost interval 
[2^“^® — 1/0, 2^“^*]. In addition, node a has budget k^m, node c has budget kxm, the nodes of 
the types Wi and x* have budget —m and all other nodes have budget 0. If we now choose as 
the initial feasible flow the flow that sends m units from a to each node of type Wi and from c 
to each node of type Xj then we obtain the initial residual network G. 

We now show that the MMCC algorithm needs n(mlog(0)) iterations for the initial residual 
network G. First we make some basic observations. The minimum-mean cycle G never contains 
the path Pj = (d, Wj,a) if the path Pi = (d, Wi,a) has positive residual capacity for some i < j, 
since the mean cost of G can be improved by substituting Pj by Pi in G. Analogously, G 
never contains the path Pj = {b,Xj,c) if the path Pi = {b,Xi,c) has positive residual capacity 
for some i < j. Also, since all cycles considered have mean cost strictly less than 1/0, cycles 
will never include more edges with cost at least —1/0 than necessary. In addition, since the 
edges of type {wi,a) and (xi,c) are saturated in the order cheapest to most expensive, none 
of these edges will ever be included in reverse direction in the minimum-mean cycle. The 
above observations lead to three candidate types for the minimum-mean cycle: cycles of type 
(d, Wi, a, u, V, d), of type (6, Xj, c, v, u, b), and of type (d, Wi, a, u, b, Xj,c, v, d). Here u and v are 
arbitrary nodes in U and V, respectively. In the following series of lemmas we compare the 
mean costs of these cycle types. Here u and v are again arbitrary nodes in U and V, possibly 
different for the cycles that are compared. In our computations we always assume worst-case 
realization of the edge costs, that is, if we want to show that a cycle Gi has lower mean cost 
than a cycle C 2 , we assume that all edges in Gi take the highest cost in their cost interval, 
while all edges in G 2 take the lowest cost in their cost interval (an edge that appears in both 
Cl and C 2 can even take its highest cost in Ci and its lowest cost in C 2 in the analysis). 
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Lemma 3.1. The cycle Ci = {d,Wi,a,u,v,d) has lower mean east than the cycle C 2 = 
(b,Xi,c,v,u,b). 

Proof. Since the cycles have equal length, we can compare their total costs instead of their 
mean costs. We have 


w{Ci) - w{C 2 ) < (-22-2* ^ 5 /^) _ (_ 2 l- 2 i _ 

< —QAjcj) + 6/(/) < 0 

Here the second inequality holds since i < kx < ^(log((/)) — 5). □ 

Lemma 3.2. The cycle Ci = {b,Xi,c,v,u,b) has lower mean cost than the cycle C 2 = 
{d, Wi+i,a,u,v,d). 

Proof. Since the cycles have equal length, we can compare their total costs instead of their 
mean costs. We have 

wiCi) - wiC 2 ) < (- 2 I- 2 * + 4/(j)) - (^_22-2h+i)^ 

< —64/(/) + 4/(f> < 0 

Here the second inequality holds since i + 1 < A;^^, < 2 (log(</>) — 4)- □ 

Lemma 3.3. The cycle Ci = {d,Wi,a,u,v,d) has lower mean eost than the cycle C 2 = 
(d, Wi, a, u, b, Xi, c, v, d). 


Proof. We have 


<"(Ci)/|Ci| - «>(C2)/|C2| < (-2^-“ + /5 - (-2"-” - 2'-“) /8 




Here the second inequality holds since i < kx < ^(log((/)) — 5). □ 

Lemma 3.4. The cycle Ci = {b,Xi,c,v,u,b) has lower mean cost than the cycle C 2 = 
(b,Xi,c,v,d, Wi+i,a,u,b). 


Proof. We have 


w{C,)/\C,\ 


w{C 2 )/\C 2 \< -2^-2* + ^ )/5 - (-2 


, 1 - 2 * 


- 5 ^ + ^^“ 



Here the second inequality holds since i + I < k^j < 2 (log((/>) — 4). □ 

The above observations and lemmas allow us to determine the number of iterations that the 
MMCC algorithm needs for residual network G. 


Theorem 3.5. The MMCC algorithm needs m{kw + kx) iterations for residual network G, 
independent of the realization of the edge eosts. 
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Proof. For the first iteration only cycles of the types {d, Wi, a, u, v, d) and (d, Wi, a, u, b, Xi, c, v, d) 


are available. According to Lemma 3.3, all cycles of type [d, Wi,a,u,v, d) have lower mean costs 
than all cycles of type {d,Wi,a,u,b,Xi,c,v,d) and therefore the first iteration will augment 
along a cycle of type {d,Wi,a,u,v,d). After the first iteration, an edge from V to U will 


become available, and therefore a cycle of the type {b,Xi,c,v,u,b). According to Lemma 3.1 


this cycle has higher mean cost than cycles of the type {d, Wi, a, u, v, d) however, and therefore 
the first m iterations will be of the type (d, Wi, a, u, v, d). 

After the first m iterations, the edge (d, tci), the edge {wi,a), and all edges in Euv will 
be saturated. The available cycle types are now (6, Xi, c, v, u, b) and (6, x*, c, n, d, tCj+i, a, u, b). 


According to Lemma 3.4, all cycles of type {b,Xi,c,v,u,b) have lower mean cost than all 
cycles of type {b,Xi,c,v,d,Wi^i,a,u,b). The next iteration will therefore augment along a 
cycle of type {b,Xi,c,v,u,b). After this iteration, an edge from U to V becomes available 


and therefore a cycle of type (d, rcj+i, a, u, v, d), but according to Lemma 3.2 all cycles of type 


{b,Xi,c,v,u,b) have lower mean cost than cycles of type {d,Wi+i,a,u,v,d) and therefore in 
iterations m + 1,..., 2m the MMCC algorithm augments along cycles of type (6, xt, c, v, u, b). 

After 2m iterations, we obtain G, except that now edges (d, tci), (u)i,a), {b,xi), and (xi,c) 
are saturated and that there is some flow on the infinite capacity edges of the types (a, Uj), 
{ui, b), (c, Vi), and {vi, d). The MMCC algorithm will keep augmenting among m cycles of type 
(d, Wi, a, u, V, d) followed by m cycles of type {b, Xi, c, v, u, b) until all edges of types {wi, a) and 
(xi,c) are saturated and no negative cost cycles remain. The total number of iterations the 
MMCC algorithm needs is therefore m(kw + kx). □ 

The instance G and Theorem 13.51 allow us to state a lower bound on the number of iterations 
that the MMCC algorithm needs in the smoothed setting. 

Theorem 3.6. For every n, every m G {n,n + 1,... ,n^}, and every cj) < 2”, there exists an 
instance with 0(n) nodes and 0(m) edges for which the MMCC algorithm requires Fl{m\og{(j))) 
iterations, independent of the realization of the edge costs. 


Proof. Follows directly from the instance G, Theorem 3.5 and the definition of k^ and kx. □ 


3.2 Lower Bound for 0 Dependent on n 


In Section 3.1 we considered the setting where (f does not depend on n. In this setting we 
showed that the MMCC algorithm needs n(mlog(i^)) iterations. We can improve the lower 
bound if 4> is much larger than n. In this section we consider the case where cj) = PL{n^). In 
particular, we show that for every n > 4 and every m G {n,... ,n^} there exists an instance 
with 0(n) nodes, 0(m) edges, and cj) = 0(n^) for which the MMCC algorithm needs Q{mn) 
iterations. 

The initial residual network H that we use to show our bound is very similar to the initial 
residual network G that was used to show the bound in Section IQ Below we describe the 
differences (see Figure]^ for an illustration). We set cj) = 400000n^. The constant of 400000 is 
large, but for the sake of readability and ease of calculations we did not try to optimize it. 


The node set W now consists of n nodes {rci ,... ,Wn} and the node set X now consists 
of n nodes {xi,... ,Xn}. 
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Figure 1: The instance G for which the MMCC algorithm needs 0(m log(0)) iterations for 
n = 4, m = 9, and 4> = 64. Next to the edges are the approximate edge costs. 


• Node a is split into two nodes oi and 02 - From node oi to 02 there is a directed path 
consisting of n edges, all with infinite capacity and cost interval [0,1/0]. Edges (a, Ui) are 
replaced by edges (a 2 ,rtj) with infinite capacity and cost interval [0,1/0]. Edges {wi,a) 
are replaced by edges {wi, ai) with capacity m and cost interval 


• Node c is split into two nodes ci and C 2 . Erom node ci to C 2 there is a directed path 
consisting of n edges, all with infinite capacity and cost interval [0,1/0]. Edges (c, Vi) are 
replaced by edges (c 2 , Vi) with infinite capacity and cost interval [0,1/0]. Edges (xi, c) are 
replaced by edges (xj, ci) with capacity m and cost interval [—(^^)^* ^ 


Note that this is a valid choice of cost intervals for the edges {wi,ai) and (xj,ci) and that 
they all have negative costs, since (xn,ci) is the most expensive of them and we have 


/ o \ 2n—1 

- 


As in Section [ 3 .1[ there are three candidate types for the minimum-mean cost cycle: cycles 
of type {d, w, a, u, v, d), cycles of type (6, x, c, v, u, b), and cycles of type (d, w, a, u, b, x, c, v, d). 
Again we assume worst-case realizations of the edge costs and compare the mean costs of cycles 
of the different types in a series of lemmas. 


Lemma 3.7. The cycle Ci = {d,Wi,a,u,v,d) has lower mean cost than the cycle C 2 = 
(J},Xi,c,v,u,b). 
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Proof. Since the cycles have equal length, we can compare their total costs instead of their 
mean costs. We have 


w{Ci) - w{C 2 ) < - 


n — 3 


n 


2i-2 


+ 


n + 5 




n — 3 


n 


2i-l 


3e n + 6 
<-+ —— < 0 


n 


(p 


Here the second inequality holds since i < n and — < —e for n > 4. 


□ 


Lemma 3.8. The cycle Ci = {b,Xi,c,v,u,b) has lower mean cost than the cycle C 2 = 
{d, Wi+i,a,u,v,d). 

Proof. Since the cycles have equal length, we can compare their total costs instead of their 
mean costs. We have 


w{Ci) - w{C 2 ) < - 


n — 3 \^* 1 ^_|_4 

n 


^_3.2(m)-2N 


3e n + 4 
<-+ —— < 0 


n 


</> 


n 


Here the second inequality holds since i + 1 < n and — < —e for n > 4. 


□ 


Lemma 3.9. The cycle Ci = {d,Wi,a,u,v,d) has lower mean eost than the cycle C 2 = 
{d, Wi, a, u, b, Xi, c, v, d). 


Proof. We have 


w{Ci)/\Ci\-w{C 2 )/\C 2 \ < 


n + 5 2n + 8 


< —e 


-12 


n + 15 


+ - < 0 


_(n + 5)(2n + 8)ny (f 
Here the second inequality holds since i < n and — for n > 4. 


□ 


Lemma 3.10. The cycle Ci = {b,Xi,c,v,u,b) has lower mean cost than the eyele C 2 = 
(b,Xi,c,v,d, Wi+i,a,u,b). 


Proof. We have 


w{Ci)/\Ci\-w{C 2 )/\C 2 \ < 


< —e 


^^ n —3 ^ 1 _j_ n+4 ^ ^^ n—3 ^ 1 ^ n—3 ^ ^(^+1) 2^ 

n + 5 2n + 8 

n + 15 


-12 


Here the second inequality holds since i + 1 < n and — ^ < —e for n > 4. 


n + 4 

(n + 5)(2n + 8)n J ^ {n + 5)(j) ^ 

=3\2ri ^ _-l2 


□ 


The above lemmas allow us to determine the number of iterations that the MMCC algorithm 
needs for initial residual network H. 


12 



Theorem 3.11. The MMCC algorithm needs 2mn iterations for initial residual network H 
independent of the realization of the edge eosts. 


Proof. The proof is similar to the proof of Theorem |3.5[ Since cycles of type {d,Wi,a,u,v,d) 
have lower mean cost than both cycles of type (6, Xi, c, v, u, b) and of type {d, Wi, a, u, b, Xi, c, v, d) 


according to Lemma |3.71 and Lemma 3.9, the first m iterations will augment flow along cycles 
of type {d,Wi,a,u,v,d). After these m iterations, edges {d,wi) and (r(;i,ai) are saturated and 
the edges in E^v have positive residual capacity only in the direction from V io U. 

Since cycles of type (6, Xj, c, v, u, b) have lower mean cost than both cycles of type {d, Wi+i,a, u, v, d) 
and cycles of type (6, Xi, c, v, d, Wi+i,a, u, b) according to Lemma [3^ and Lemma 3.10, the next 
m iterations will augment flow along cycles of type {b,Xi,c,v,u,b). After the first 2m itera¬ 
tions, the residual network is the same as H, except that edges {d,wi), {wi,ai), (6,xi), and 
(xi,ci) are saturated and there is some flow on several edges of infinite capacity. The MMCC 
algorithm will keep augmenting along m cycles of type {d,Wi,a,u,v,d) followed by m cycles 
of type {b,Xi,c,v,u,b), until all edges of type {wi,ai) and type (xj,ci) are saturated. At this 
point no negative cycles remain in the residual network and the MMCC algorithm terminates 
after 2mn iterations. □ 


Initial residual network H and Theorem 13. Ill allow us to state a lower bound for the number 
of iterations that the MMCC Algorithm needs in the smoothed setting for large (j). 

Theorem 3.12. For every n > 4 and every m G {n,n -|- 1,... ,n^}, there exists an instance 
with 0(re) nodes and 0(m) edges, and (j) = 0(n^), for which the MMCC algorithm requires 
Q.{mn) iterations, independent of the realization of the edge costs. 


Proof. Follows directly from the instance H and Theorem 3.11 


□ 


4 Lower bound for the Network Simplex Algorithm 

In this section we provide a lower bound of kl(m ■ minjn, </>} • 4>) for the number of iterations 
that the Network Simplex (NS) algorithm requires in the setting of smoothed analysis. The 
instance of the minimum-cost flow problem that we use to show this lower bound is very similar 
to the instance used by Brunsch et al. to show a lower bound on the number of iterations 
that the Successive Shortest Path algorithm needs in the smoothed setting. The differences 
are that they scaled their edge costs by a factor of </>, which we do not, that we add an extra 
path from node s to node t, and that the budgets of the nodes are defined slightly differently. 
For completeness, we describe the construction of the (slightly adapted) instance by Brunsch 
et al. below. For a more elaborate description of the instance, we refer to the original paper. 
In the following we assume that all paths from s to t have pairwise different costs, which holds 
with probability 1, since the edge costs are drawn from continuous probability distributions. 

For given positive integers n, m G {n,... ,n^}, and cp < 2^ let k = [log(^J — 5 = 0{n) and 
M = min{n, 2 l-*°S 2 — 2} = 0(min{n, (/>}). Like Brunsch et al. we assume that (p > 64, 
which implies k,M > 1. We construct a flow network with 0(n) nodes, 0(m) edges, and 
smoothing parameter (p for which the NS algorithm needs 0(m • min{n, (p} ■ <p) iterations. The 
construction of the flow network G that we use to show our lower bound consists of three steps. 
First we define a flow network Gi. Next we describe how to obtain flow network Gi+i from 
flow network Gi. Finally, we construct G using Gk. As before, we give an interval of width at 
least l/(p for each edge from which its costs are drawn uniformly at random. 
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Figure 2: The instance G for which the MMCC algorithm needs Q{mn) iterations for n = 4, 
m = 9, and 4> = 400000n^. Next to the edges are the approximate edge costs. 


Construction of Gi. For the first step, consider two sets U = {ui,...,Un} and W = 
{rci,... ,Wn} of n nodes and an arbitrary set Ejjw C U x W containing exactly \Ei/w\ = m 
edges. The initial flow network Gi is defined as Gi = {Vi,Ei) for Fi = [/ U kF U and 

El = ({si} X 17) U Euw U (VF X {h}). 

The edges e from Ejjw have capacity 1 and costs from the interval Ig = The edges 

{si,Ui),Ui G U have capacity equal to the out-degree of Ui, the edges {wj,ti),Wj G VF have 
capacity equal to the in-degree of Wj and both have costs from the interval Ig = [Oj (see 
Figure]^. 


Construction of Gj+i from Gi. Now we describe the second step of our construction. 
Given a flow network Gi = {Vi,Ei), we define Gi+i = (Vi+i, Ei+i), where Vi+i = 14 U 
and 

Ei+i = EiU ({sj+i} X U ({si,ti} X {4+i}). 


Let Ni be the value of the maximum Sj-tj flow in the graph Gi. The new edges e G {(sj+i, Sj), (ti,ti+i)} 
have capacity u(e) = Ni and costs from the interval Ig = [0, ^]. The new edges e G {(sj+i, fi), (s*,L+i)} 


also have capacity u(e) = Ni, but costs from the interval Ig = [- 


91+3 


+ii 


(see Figure 


4). 


Construction of G from Gk- Let F be the value of a maximum s^-tk flow in Gk- We will 
now use Gk to define G = [V,E) as follows (see also Figure]^. 

• V := Vk U AU B U C U D U Q U with A := {ai, 02,..., aiw}, B := {bi, 62, ... , I'm}, 
G := {ci,C2,...,cm}, D := {+, ^2, • • •, cIm}, and Q := {qi,q2, ■ ■ ■ ,q2M}- 


14 






















Figure 3: Example for Gi with n = 3 and m = 7 with capacities different from 1 shown next to 
the edges and the cost intervals shown below each edge set. Dashed edges are in the 
initial spanning tree for the network simplex algorithm, while solid edges are not. 



Figure 4: Gj+i with Gi as sub-graph with edge costs next to the edges. Dashed edges are in 
the initial spanning tree for the network simplex algorithm, while solid edges are not. 
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• E := EkU EalJ EcU EdlJ Eg. 

ofc+5 1 ofc+5 

• Ea contains the edges (aj,ai_i), i G {2,...,M}, with cost interval [— 

infinite capacity; {s,ai), i G M}, with cost interval [0,^] and capacity E; and 

(oi, Sk) with cost interval infinite capacity. 

• Ef, contains the edges (6^, i G {2,..., M}, with cost interval and infi¬ 

nite capacity; (s, 6j), i G {1,..., M}, with cost interval [0, and capacity F; and (6i, tfc) 
with cost interval infinite capacity. 

• Ec contains the edges (ci_i,Cj), i G {2,... ,M}, with cost interval and infi¬ 

nite capacity; {ci,t), i G {1,..., M}, with cost interval [0, and capacity E; and (s^, ci) 
with cost interval and infinite capacity. 

o fe I 5 1 o /c I 5 

• Ed contains the edges {di-i,di), i G {2, ...,M}, with cost interval [—and 

infinite capacity; {di,t), i G M}, with cost interval [0,^] and capacity F; and 

{tk,di) with cost interval and infinite capacity. 

• Eq contains the edges {qi-i, Qi), i G {2,..., 2M}, with cost interval and in¬ 
finite capacity; {s,qi), with cost interval and infinite capacity; and {q 2 M,t) 

with cost interval infinite capacity. 

The budgets of all nodes are 0, except for node s and t, which have budgets b{s) = 2ME 
and b{t) = —2MF. We choose as the initial spanning tree T for the NS algorithm the edges 

• (si,Mi) (i = 1 ,... ,n) and (i = 1,... ,n), 

• (si+i,Si) (i = 1,..., A: - 1) and {ti,ti+i) {i = I,..., k - 1), 

• (s,ai), (ai,aj_i) (i = 2,...,M), and (ai,Sfc), 

• (sfc, Cl) and (cj_i, Cj) (z = 2,..., M), 

• (bi, bi-i) {i = 2,... ,M) and (6i, 4), 

• idi,t), {tk,di), and {di-i,di) {i = 2,.. .,M), 

• is,qi), {qi,qi+i) {i = 1,... ,2M - 1), and {q 2 M,t). 

(see Figure Figure and Figure [^. In addition, L = E\T and 11 = %. Spanning 
tree structure {T,L,U) corresponds to the flow that sends 2MF units of flow on the path 
(s, gi,..., q 2 M, t) and does not send any flow on other edges. 

To prove our lower bound on the number of iterations that the NS algorithm needs for flow 
network G, we link the iterations of the NS algorithm to the iterations of the Successive Shortest 
Path (SSP) algorithm for flow network H, where H is obtained from G by removing all nodes 
qi {i = 1,..., 2M) and all their incident edges. As shown by Brunsch et al. Theorem 23], the 
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capacity F 
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capacity oo 
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0000 


Qi - -> Q2 - -> Q3 - -> Qi - -> Qb - -> Qe - -> Q7 - -> Qs - -> Qd - -> Qio 

2^+5 2^4‘5 2^+5 2^^"^ 2^“I“5 ^k-\-5 2^“I"5 i2k-\-5 2^4‘5 


0 0 


4> 4> <f> 4i 


capacity oo 


Figure 5: G with as sub-graph with approximate edge costs next to the edges. Dashed 
edges are in the initial spanning tree for the network simplex algorithm, while solid 
edges are not. 


SSP algorithm needs Q{m ■ min{n, (p} ■ (p) iterations for H. We show that each non-degenerate 
iteration of the NS algorithm on G corresponds with an iteration of the SSP algorithm on H 
and that therefore the NS algorithm needs D(m • min{n, <p} ■ <p) iterations for flow network G 
as well. In our analysis we use many results on the costs of paths in H by Brunsch et al. [^. 
We will not prove these results again, but refer to the original paper. 

We now first observe that path (s, qi,..., q 2 M, t) is the most expensive path from s to t in 
G. Since paths from Sk to tk have cost less than Lemma 21], the most expensive s — t 

paths that do not use the nodes qi are (s, um, • • ■, cm, t) and (s, Um, ■ ■ ■, dM,t). Both those 
paths have the same distribution for their costs. If we arbitrarily choose path (s, um, ■ ■ ■, CM,t) 
and compare its cost with the cost of path ( 5 ,( 71 ,... ,q 2 M,'t), assuming worst case edge cost 
realizations, we have 


c(s, ( 71 , ... , q2M, t) — c(s, ttM, ■ • • , CM,t) 

2k+^-l\ /(2M-1)2*=+5-P2'=+4-F2 


> (2M -F 1) 




3 • 2^+^ - 2M - 3 


- > 0 , 


by the definition of k and M and cp > 64, which shows that path (s, gi,..., q 2 M, t) is the most 
expensive path from s to t in G. 

Since (s, ( 71 ,..., q 2 M, t) is the most expensive s — t path, we can obtain a negative cycle G 
by combining the reverse edges of (s, gi,..., q 2 M,t) with the edges of another s — t path P. 
Clearly, the cheaper P is, the cheaper is G. In addition, if all edges of G except for one edge 
{i,j) are in the current spanning tree T, then edge {i,j) has reduced cost equal to the cost of 
G, since all edges in T have reduced cost 0. Using these two observations, we can conclude 
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that as long as all negative cycles that can be formed using edges in the current spanning tree 
T plus one edge outside of T that has positive residual capacity in the current residual network 
consist of path (t, q 2 M, ■ ■ ■ ,qi,s) plus an s — t path P, then the NS algorithm with the most 
negative edge pivot rule will pivot on the edge that together with the edges in T forms the 
cheapest s — t path. 

Brunsch et al. showed that the SSP algorithm encounters 2MF paths on H. Let 
Pi,---,P 2 MF be the paths encountered on H by the SSP algorithm, ordered from cheap¬ 
est to most expensive. We refer the reader to the paper of Brunsch et al. for a description 
of these paths. In the following we show that in the non-degenerate iteration of the NS 

algorithm on G, flow is sent along cycle (t, q 2 M, ■ ■ ■ ,qi, s) Li Pi. We will not provide all the 
calculations needed to compare the reduced cost of the candidate edges for addition to the 
spanning tree in each iteration, but it can be checked by tedious computation that the claimed 
edge is indeed the one with lowest reduced cost. If flow is sent over one of the edges in Ejjw, 
that edge is always a candidate leaving edge, since edges in Ejjw have capacity 1 and all other 
capacities are integral. For simplicity we assume that in cases where multiple edges in the 
cycle become saturated simultaneously by the flow augmentation, it is always the edge from 
Euw that leaves the spanning tree. 

In the first iteration of the NS algorithm on G, all edges in Euw have negative reduced cost 
and positive residual capacity, and the edge {ui,Wj) that together with the starting spanning 
tree T contains path Pi is added to T, flow is augmented along cycle (t, q 2 M, ■ ■ ■ ,qi, s) U Pi, 
and {ui,Wj) becomes saturated and leaves T again. For the second iteration, edge {ui,Wj) is 
saturated and therefore the edge that together with T contains P 2 will be added to T. This 
will continue for the first m iterations. 

At this point, the cheapest s — t path using edges in T plus one edge with positive residual 
capacity is the path that is obtained by using either edge (s 2 ,ti) or edge (si,t 2 ) (depending 
on the realization of the edge costs). The next two iterations will therefore be degenerate. 
In one of these iterations {s 2 ,ti) is added to the spanning tree, but edge (ti,t 2 ) is saturated, 
prevents any flow being sent along the cycle, and is therefore removed from the spanning tree. 
In the other iteration {si,t 2 ) is added to the spanning tree and (s 2 ,si) is removed. After 
these two iterations the edges in Pf/iy become eligible for augmenting flow in the reverse 
direction and the next m iterations augment flow along the cycles {t, q 2 M, ■ ■ ■ ,qi,s) LI Pm+i, 
... , {t,q2M, ■ ■ ■ ,qi,s)U P2m. 

Analogously to the above, every time an edge (si+i,Sj) gets saturated, two degenerate iter¬ 
ations take place in which edges (sj,ti+i) and (si+i,ti) are added to the spanning tree. This 
allows flow to be sent through Gi in reverse direction, that is, from L to s*. Similarly, every 
time an edge (that is, the reverse edge of an original edge (si,L+i)) in the resid¬ 

ual network gets saturated, two degenerate iterations take place in which edges (L,ti+i) and 
(sj+i. Si) are added to the spanning tree. 

After P iterations, there are no paths with positive residual capacity from Sk to At this 
point another two degenerate iterations take place. In one of them edge (ci,t) is added to 
the spanning tree, but no flow is sent since (s,ai) has zero residual capacity and therefore 
(s,ai) leaves the spanning tree. In the other iteration (s,6i) is added to the spanning tree 
and {dip) leaves. Now the edges in Euw can be added to the spanning tree again and flow is 
augmented along them in reverse direction during the next m iterations. In particular, in the 
next iteration flow is augmented along cycle (t, q 2 M, ■ ■ ■ ,qi, s) LI Pp+i- 

Analogously to the above, every time an edge (s, Oj) gets saturated, two degenerate iterations 
take place. In one of them (cj, t) enters the spanning tree and (s, Oj) leaves. In the other (s, bi) 
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enters the spanning tree and {di,t) leaves. Also, every time an edge {s,bi) gets saturated, two 
degenerate iterations take place. In one of them {di+i,t) enters the spanning tree and (s,6i) 
leaves. In the other (s,aj+i) enters the spanning tree and {ci,t) leaves. 

Finally, after 2MF iterations none of the edges not in the spanning tree has both negative 
reduced cost and positive residual capacity and therefore the NS algorithm terminates. From 
the above discussion we can conclude that the NS algorithm on G needs 2MF non-degenerate 
iterations plus several degenerate ones. 

Theorem 4.1. For flow network G and initial spanning tree structure {T,L,U), the NS algo¬ 
rithm needs 2MF non-degenerate iterations with probability 1. 


Proof. Follows immediately from the discussion above. 


□ 


Theorem 4.1 provides a lower bound for the number of iterations that the NS algorithm 
needs in the smoothed setting. 


Theorem 4.2. For every n, every m G {n, ...,n^}, and every (/> < 2” there exists a flow 
network with Q(n) nodes and Q{m) edges, and an initial spanning tree structure for which the 
Network Simplex algorithm needs n(m-min{n, <^} •<?!)) non-degenerate iterations with probability 

1 . 


Proof. Follows directly from Theorem 4.1 and the dehnition of M and F. 


□ 


5 Discussion 


In Section we showed a smoothed lower bound of • min{n, (/)} ■ (f) for the number of 
iterations that the NS algorithm needs. This bound is the same as the smoothed lower bound 
that Brunsch et al. showed for the SSP algorithm. For the SSP algorithm this lower bound 
is even tight in case </> = 0(n). Still, the NS algorithm is usually much faster in practice than 
the SSP algorithm. We believe that the reason for this difference is that the time needed 
per iteration is much less for the NS algorithm than for the SSP algorithm. In practical 
implementations, the entering edge is usually picked from a small subset (for example of size 
Q{^/m)) of the edges, which removes the necessity of scanning all edges for the edge which 
maximally violates its optimality conditions. Also, the spanning tree structure allows for fast 
updating of the flow and node potentials, in particular when the flow changes on only a small 
fraction of the edges. For the SSP algorithm an iteration consists of finding a shortest path, 
which takes 0{m + nlog(n)) time. The experimental results of Kovacs [15] seem to support 
this claim, since on all test instances the SSP algorithm is slower than the NS algorithm, but 
never more than a factor m. To allow a better comparison of the SSP algorithm and the NS 
algorithm in the smoothed setting, it would be useful to have a smoothed upper bound on the 
running time of the NS algorithm. Finding such an upper bound is our main open problem. 


There is a gap between our smoothed lower bound of n(m log((/>)) (Section 3.1) for the 
number of iterations that the MMCC algorithm requires and our smoothed upper bound of 
0(mn^ log(n) log((/))). Since our lower bound for the MMCC algorithm is weaker than the 
lower bound for the SSP algorithm, while the MMCC algorithm performs worse on practical 
instances than the SSP algorithm, we believe that our lower bound for the MMCC algorithm 
can be strengthened. Our stronger lower bound of Pl{mn) in case (f = O(n^) (Section 3.2) is 
another indication that this is likely possible. 
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